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What is claimed is: 

1. A method for calculating the similarity of at least one 
chemical compound to at least one chemical probe, the at least one 
chemical probe including at least another chemical compound, the 
method comprising Ahe steps of: 

(a) creating at least one chemical descriptor for each 
compound in a collection of compounds; 

(b) representing\ at least one chemical descriptor for each 
compound as at least one vector comprising at least one descriptor 
frequencies; \ 

(c) representing the collection of compound the at least one 
vector as a first vector of a molecule-descriptor matrix; 

(d) performing singular value decomposition of the molecule- 
descriptor matrix to produce\at least one singular matrix; 

(e) generating at least orie chemical probe descriptor for the 
at least one chemical probe; \ 

(f ) using the at least one singular matrix to transform the at 
least one chemical probe descriptor\of the at least one chemical 
probe into a first coordinate system an least substantially similar 
to a second coordinate system of the at\ least one compound; 
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(g) calculating the similarity of transformed probes to the 
compounds in the collection, and 

(h) outputting a list of at least a subset of compounds in the 
collection ranked in order of similarity to the at least one probe. 



2. The method as recited in claim 1, wherein said step of 
creating at least one descriptor includes generating atom pair and 
topological torsion descriptors from chemical connection tables of 
the collection of compounds, 



The method as 



^ed in claim 1, wherein said step of 



creating at least one descriptor includes creating an index of 
descriptors and an indexySf \compounds in the collection. 



4. The method as Vecited in claim 1, wherein said molecule- 
descriptor matrix is denoted as X, 

wherein said step of \perf orming singular value decomposition 
includes generating singular matrices as X = PEQ T of rank r, and a 
reduced dimension approximation of X defined as X k = P k E k Q T k k<<r, 
where P and Q are the left andYright singular matrices representing 
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correlations among descriptors and compounds respectively, and 2 
represents the singular values, 

wherein the alt least one produced singular matrix includes a 
pseudo-object denoted as 0 F and is calculated from a probe F by 
0 F = F^E'^ , and 

wherein said step of calculating the similarity between the 
pseudo-object 0 F and nhe compounds in collection is computed by 
taking a dot product \of a normalized vector of 0 F with each 
normalized row of P k . 



5. The method as recited to claim 4, wherein said similarity 
calculating step includes calculating cosine between each pair of 



vectors . 



15 



6. The method as recited in claim 4, wherein said step of 
performing singular value decomposition includes deriving the 
reduced dimensional approximation of ^ by setting the k+1 through 
r singular values of £ to zero. 
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7. The method as recited in claim 4, wherein similarities of 
the pseudo-object to compounds in the collection is calculated by 
setting the first k singular values of £ to one. 



8. The method as reodted in claim 7, wherein said setting 
step includes using an identity matrix I. 
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9. A method of generating a searchable representation of 
chemical structures comprising : 

(a) generating an indejaof unique features; 

(b) generating a f eatureVerFiemica\ structure matrix including 
vectors that describe the ichemMal structures; and 



(c) determining correlatio: 



on the generated feature-chemicalA structure/ matrix for generating 



15 the searchable representation c 



*een chemical structures based 



f the chemi/cal structures, 



10. The method according co claim 9, wherein the index of 



unique features include chemical descriptors. 
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11. The metopd according to claim 9, further comprising 
generating the chemical descriptors from connection tables prior to 
said index-generating step (a) . 



12. The method according to claim 9, wherein said determining 
step (c) includes performing singular value decomposition of the 
feature -chemical struc 



13. The method acco 



descriptors include 



topological torsion 



hydrophobic pair descripto 
and geometry descriptors. 




o claim 9, wherein the chemical 



of atom pair descriptors, 



harge pair descriptors, 
atom property descriptors; 



14. A computer readable meddum including instructions being 
executable by a computer, the instructions instructing the computer 
to generate a searchable representation of chemical structures, the 
instructions comprising : 

(a) generating an index of uniqu^y features ; 
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(b) generating a feature-chemical structure matrix including 
vectors that describe the chemical structures; and 



(c) determining 



on the generated 



lations between chemical structures based 



re-chemical structure matrix for generating 



the searchable representation of the chemical structures, 



15. The computer\ readable medium according to claim 14 , 
wherein the index of unique features include chemical descriptors. 



16. The compute 



further comprising 
connection tables prior 

17. The computer^ 
wherein said determinin 
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e medium according to claim 14, 



chemical descriptors from 



generating step (a) 



e medium according to claim 14, 
,(c) includes performing singular 



value decomposition of the feature -chemical structure matrix. 

18. The computer readable medium according to claim 14, 
wherein the chemical descriptors include at least one of atom pair 
descriptors, topological torsion descriptors, charge pair 
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descriptors, hydrophobic pair descriptors, inherent atom property- 
descriptors; and geometry descriptors. 



19. The computer readable medium according to claim 16, 
wherein the instruction^ further comprise the steps of: 

determining whethen a user has input a query compound probe; 
generating chemical ^descriptors for the query compound probe; 
calculating similarities between the chemical descriptors for 



the query compound prob 



U3 chemical structures; 



ranking the chem 



compound probe . 



20. The computer r 




he searchable representation of the 



Is by similarity to the query 



medium according to claim 19, 



15 wherein the instructions further aomprise the step of: 

modifying the query compound \ probe based on the generated 
chemical descriptors for the query compound probe. 
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